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The Solovay-Kitaev algorithm is the standard method used for approximating arbitrary single- 
qubit gates for fault-tolerant quantum computation. In this paper we introduce a technique called 
search space expansion, which modifies the initial stage of the Solovay-Kitaev algorithm, increasing 
the length of the possible approximating sequences but without requiring an exhaustive search over 
all possible sequences. We show that our technique, combined with a GNAT geometric tree search 
outputs gate sequences that are almost an order of magnitude smaller for the same level of accuracy. 
This therefore significantly reduces the error correction requirements for quantum algorithms on 
encoded fault-tolerant hardware. 



INTRODUCTION 

The biggest challenge to building working quantum com- 
puters is the problem of fault-tolerance pQ. Unless a 
quantum computer is specifically designed and built to 
withstand the effects of errors from environmental de- 
coherence and inaccurate hardware operations, it will 
not be able to perform computations of any significant 
size [2]. One prominent way to control errors is to cor- 
rect them when they arise, by encoding quantum data 
in many physical qubits. If the underlying hardware is 
accurate to "threshold" value, then these error correc- 
tion codes can in principle keep an arbitrary computation 
error- free given enough physical resources [3HS]. 

One of the drawbacks to error correction codes, how- 
ever, is that only a very small number of logical gates are 
available to the code. To implement a given algorithm, 
the gates in the algorithm must be decomposed into the 
fundamental "library" gate set available to the code. In 
general this cannot be performed exactly - we must look 
for sequences of library gates that approximate the gates 
we require [BJ. 

The key problem when finding an approximation to an 
arbitrary gate is the size of the space over which it is nec- 
essary to search in order to find a good approximation. In 
the general case, the longer the sequence of library gates 
(and their inverses), the better an approximation to the 
gate in question can be found. However, high-accuracy 
exhaustive searches become untenable on current compu- 
tational technology. The Solovay-Kitaev approximation 
algorithm was introduced to get around this difficulty 
[BJ [7J . This recursive algorithm performs an exhaustive 
search only over the space of sequences of length up to Iq, 
finding the sequence with the smallest distance (defined 
by the trace norm) from the gate we wish to approximate. 
The residual difference between the actual and approxi- 
mate gate is then sent to the next level of the algorithm, 
where it is further approximated. At each level the length 
of the gate sequences grows by a factor of 5. The algo- 
rithm terminates at a "good" approximation, where the 
distance to the actual gate is less than a chosen constant 
e. 



While very powerful, the Solovay-Kitaev algorithm suf- 
fers from a serious weakness. While it will always find a 
good approximation for any value of e, the search covers 
only a very sparse region of the entire space of possible 
approximation sequences. As a consequence, the out- 
put from the approximation is almost always far longer 
than it needs to be. This is an extreme disadvantage 
for fault-tolerant computation, as this greatly increases 
the logical depth of the computation, thus increasing the 
amount of error correction required, which in turn in- 
creases the physical size and run time of the algorithm. 
Reducing these requirements as far as possible is key to 
implementing realistic fault-tolerant quantum computa- 
tion. 

Currently there are two alternatives to the original 
Solovay-Kitaev algorithm. The first is an exhaustive- 
search algorithm that, in certain cases, can give a very 
efficient library-gate decomposition [8]. However, in the 
general case the scaling is exponential as in a standard 
exhaustive search and therefore limited by computational 
ability. The second is the "phase kickback" method orig- 
inally given in [7J. This can produce shorter gate se- 
quences by using special ancilla states, but requires many 
more qubits for the ancilla and is also restricted in the 
library gate set it can use. 

In this paper we describe an alternative gate decomposi- 
tion algorithm that modifies the original Solovay-Kitaev 
method and allows a much denser search of the space 
of sequences. The key to improving the algorithm is to 
concentrate on the initial approximation, and we dra- 
matically increase the search space by combining two se- 
quences out of the database of initial sequences. This 
expansion technique can be implemented recursively, cre- 
ating an even better initial approximation. To make this 
search step computationally tractable, we have applied a 
geometric nearest-neighbour access tree search procedure 
(GNAT) [5]. This procedure is a fully general method for 
any gate and any library gate set, and with high probabil- 
ity produces approximations that are significantly shorter 
than those given by the original algorithm, for the same 
precision. 

The paper is structured as follows. Section 2 reviews 
the Solovay-Kitaev theorem, along with with the original 
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implementing algorithm proposed in [SJ [7] . In Section 3 
we introduce the search space expansion technique (SSE) 
that we will use to supplement the original algorithm. We 
show that by splitting each candidate sequence and then 
searching points near to the subsequences, we can gen- 
erate candidate sequences that with high probability are 
a better approximation but which are shorter than those 
at the next level of recursion. One potential issue with 
this approach is an added classical search requirement 
over the sampled sequences, and in Section 4 we describe 
a way of efficiently searching these sequences to reduce 
the time required compared with a standard exhaustive 
search. In Section 5 we compare the original Solovay- 
Kitaev algorithm with our modified algorithm for a set 
of 25 single-qubit gate unitaries chosen at random. On 
average, the length of the candidate sequence reduces by 
a factor of 7, and for some sequences it can reduce it by 
an order of magnitude. The number of levels of recursion 
in the algorithm is also greatly reduced, with generally 
only two or at most three levels being needed even for 
very high accuracies. 



THE SOLOVAY-KITAEV APPROXIMATION 

The Solovay-Kitaev theorem tells us we can always ap- 
proximate a single-qubit gate G to arbitrary accuracy e 
with a finite sequence of fundamental gates from a uni- 
versal "library set" of gates {L\, L2, ■ ■ ■ Lm} and their 
adjoints. The Solovay-Kitaev algorithm gives a method 
of finding what these sequences are, for a given G, e, and 
{Li}. 

The Solovay-Kitaev theorem is invoked as part of the 
compilation process for fault-tolerant quantum comput- 
ing. The library set on a fault-tolerant computer is in 
general very limited, with only a handful of gates able to 
operate within the code space. For example, in the sur- 
face code we have access only to a fundamental library 
of four gates: {CNOT , H, S, T} (where S is the phase 
gate and T a Z-rotation of 7r/4)[TD]. In general, the li- 
brary contains a 2-qubit entangling gate such as CNOT or 
controlled-phase, and then some single-qubit gates. Any 
multi-qubit gate can be decomposed exactly into a combi- 
nation of a 2-qubit maximally entangling gate plus arbi- 
trary single qubit rotations , so this becomes the first 
stage of compilation for an algorithm. The next stage 
is then the further task of decomposing these arbitrary 
single qubit gates into the single qubit gates of the error 
correction code library. 

The Solovay-Kitaev theorem states that, for a given gate 
67, accuracy e, and library gate set {Li}, there always ex- 
ists a sequence of library gates (lA =1 Aj | Aj E {Li}{L\}) 
such that 

||G-n ; AI|<e (i) 



using the standard operator trace norm distance 

\\M-N\\ = Tr ^{M - AT)t(M - N)) (2) 

All gates are represented here by unitary operators in 
SU(n), where n is the dimensionality of the gate. The 
theorem further states that the length of the sequences, I, 
varies with the required accuracy e as I = O {log c (1/e)). 
Exactly what the constant c is depends on the partic- 
ular implementation of the decomposition. It is known 
that the best possible scaling is c = 1, but with a non- 
constructive proof [IS]. The standard algorithm gives a 
scaling of sequence size with accuracy of c s» 4 [T3] . 
The most straightforward procedure for performing such 
a decomposition is to search over all sequences, begin- 
ning with the shortest first, until one is found within 
the required e of the gate G being decomposed. Unfor- 
tunately, such an exhaustive search becomes untenable 
very quickly. For a library of n fundamental gates, then 
the number of sequences of length I comprising these li- 
brary gates and their adjoints is 2n l . For example, if we 
have a library of 5 fundamental single-qubit gates (as, 
for example, in the surface code: H, S, ,T,T^) then a 
modern server with 64GB memory could hold only up 
to sequences of length I sa 13. Searching over this size 
of database is also a significant classical processing task. 
The longer the sequences are then the higher the chance 
of finding a sequence within e of G; without the ability 
to search longer sequences, the correct accuracy may be 
unobtainable. 

The standard Solovay-Kitaev algorithm uses such an ex- 
haustive search technique at its base layer, but then 
builds on that recursively. We can describe the algorithm 
in iterative fashion as follows. The base level approxima- 
tion comprises a search over the space of all sequences of 
length up to Iq. The closest approximation to G is found, 

S(0) = Il l !L 1 A j \A j e{L i }U{Lt} (3) 

We can then decompose the gate as G = U(S)S(0). The 
operator U(5) is the "residual" of the approximation: 
how far away from G the operator sequence S(0) still 
is. 

If I |G — 5(0)|| < e then the algorithm terminates here 
and returns S(0) as the appropriate gate sequence. If 
the residual U(S) is too great, however, the algorithm 
proceeds to the next level. A further exhaustive search 
of the space of sequences of length up to Iq is performed, 
this time in order to find the best approximation to U (5) . 
A subtlety at this step in the algorithm is that we do not 
have a closed form for U (5) , so need to find an approxi- 
mation for GS^(0) instead. The algorithm performs this 
by decomposing further into VWV^W^ = GS^(0) where 
V and W are the unitary gates that are then searched 
for. The sequence VWV^W^ that is closest to U(5) is 
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then returned by the search, and so the first-level approx- 
imation becomes 

5(1) = VWV*Wi 5(0) (4) 

Note that V, W, 5(0) are all sequences of length up to l ; 

the sequence 5(1) is therefore of length up to 51q. 

If \\G — 5(1)|| < e then the algorithm terminates and 

returns 5(1). If not, the previous step is repeated to find 

a decomposition of the residual U(6i) = GS'(1). This 

is repeated until a sequence of the desired accuracy is 

found. 

As we can see from equation Q, in the standard Solovay- 
Kitaev algorithm, the length of the approximating se- 
quence grows by a factor of 5 at each level of recur- 
sion. As a result, the algorithm can only produce the 
approximating sequences of the length in the set 5; = 
{lo, 5lo, 25Zo, 125Zo, ■■■}, where Iq is the length of the ap- 
proximating sequence for the basic stage of the decompo- 
sition algorithm. Therefore, the vast majority of possible 
approximating sequences, which are not in Si, cannot 
be generated by the algorithm. Furthermore, the best 
strategy for finding a short gate sequence is to make the 
initial approximation length ^o as large as possible; how- 
ever, this then means that at every step in the algorithm 
the length of the approximating sequence grows dramat- 
ically. Because of this, the most efficient approximating 
sequences are likely to be missed, leading to much longer 
sequences than are necessary to reach the desired accu- 
racy of approximation. 

SEARCH SPACE EXPANSION 

The technique we will use is to expand the search space at 
the first level of the standard algorithm so that sequences 
that are longer than Iq are also covered. We assume that 
lo is the longest possible sequence space that our compu- 
tational resources can exhaustively search. By improving 
the accuracy of this initial approximation, eo, we reduce 
the residual to be approximated at the next level of the 
algorithm. It is then intuitively reasonable that this will 
reduce the number of recursion levels implemented to find 
a sequence accurate to a given e. Formally, the residual 
error at recursion level n is given by 

e»=i (5) 

(with c as above) [13] . We can therefore conclude that the 
more we reduce eo, the smaller will be e„; and therefore 
the sooner the algorithm will find a sequence e„ < e. 
The initial stage of the algorithm gives us the first ap- 
proximation gate sequence 5(0), equation ([3]). The resid- 
ual distance, eo, from the exact gate G is 

\\G- 5(0)|| <e (6) 




FIG. 1: Expanding the search space. Sequence 5(0) (a 
good first approximation to gate G) is split into two 
halves, 5(0) = 5^ (0)5^(0). Points in regions of 
distance eo (shaded) away from 5^(0), 5^(0) are 
recombined, and the closest to G, Z(0) is chosen. With 
high probability, ||G-5(0)|| > \\G - Z(0)\\. 

We now start our space expansion technique. For simplic- 
ity, we partition the sequence 5(0) into two equal halves 
(in fact, the procedure can be performed by splitting the 
sequence into any number of parts, which may be un- 
equal). Each of these subsequences 5^(0), 5^(0) is of 
length Iq/2. For example, for a sequence 5(0) = HT^S^T 
we would have 5«(0) = HI* 5^(0) = S^T. 
We now search once again over the space of sequences 
of length Iq [2D] to find approximations {Z^\o)} and 
{Zf } (0)} within eo of 5^(0) and 5^(0): 

ViM-S^CO) -^(o)!! < eo 
Vj||5( 2 >(0)-£f(0)|| < eo (7) 

where we define the search regions by 

eo = 0.5e (8) 

Figure [T] shows this procedure schematically. 
We now form the set of sequences 

{Z^0) = zf>(0)Zf (0)} (9) 

These sequences are of length 2/ 1; and are derived from 
5(0). We then search the sequences Zij(0) to find the 
one, Z(0), with minimal distance to the actual gate G, 

\\G-Z(0)\\ =e min (10) 

We now wish to show that, in general, 

£min < e(0) + e(0) = e(0) (11) 

that is, that we have found a sequence Z(0) of length 21 1 
that is a closer approximation to G than was the sequence 
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Algorithm 1: Space_Expansion 

Input: G G SU(2): Target of approximation 
Input: S: Universal set of gates 

Input: 5*o : Stored set of sequences of gates from S of length Iq 
Input: Si: Stored set of sequences of gates from S of length fa 
Input: eo: Accuracy of initial approximation 
Input: eo: Accuracy of loop-internal approximation 

Input: k: The desired cardinality of the set to be returned, k = 1 when Space_Expansion is used as a standalone 

algorithm, k > 1 when Space_Expansion is used as a component of Recursive_Space_Expansion 
Output: k approximating sequences for G of length 2h 

1 Space_Expansion (G, S, So, Si, eo, eb, k) 

2 7? <S— {r G Sq\ distance(r, G) < e }; 

3 foreach r G R do 

4 Split r into 2 sub-sequences of the same length called r pre and r BU f, 

5 Ri <— {ri G Si\ distance(ri, r prc ) < eo}; 

6 Ri «- {r 2 G Sij distance(r 2 ,r suf ) < el)}; 

7 Join i?i and i? 2 to have the following set: 

8 7? 3 {rir 2 \ n G i?i and r 2 G 7? 2 }; 

9 end 

10 i?4 A; best approximations for G G -R3; 

11 return R4 



S(0) of length l , given by the original Solovay-Kitaev 
algorithm. To do this, consider any four gate sequences 
tA.,2, V\ : 2- Given that 

\\U l -V 1 \\<e 1 and \\U 2 - V 2 \\ < e 2 (12) 

we have that 

\\U l U 2 -V 1 V 2 \\ < WUM-ViUiW + WVM-ViVaW, (13) 
therefore 

\\U X U 2 - V X U 2 \\ + \\ViU 2 - ViV 2 \\ 
- H^i-Vill + HUb-^all 
< £i +«2- (14) 

We can therefore conclude that 

\\U 1 U 2 -V 1 V 2 \\<e 1 +e 2 . (15) 

Since ei and e 2 are relatively small (Ul is a close ap- 
proximation for Vi and E/2 is a close approximation for 
V 2 ), it is possible to say that a sequence UiU 2 is a rela- 
tively close approximation for V\V 2 . If each U £ SU (2) is 
presented by the specified vector u 6 M 3 or equivalently 
specified point in 3-D space (see below for further details 
of this mapping), U\U 2 is in the sphere of centre V\V 2 
and radius ei + e 2 . With a large number of sequences U\ 
and U 2 and their combinations U\U 2 , the probability of 
having a very close approximation for V\V 2 is relatively 
high. 

We therefore conclude that 

e mi „ < c(0). (16) 



for our new approximating sequence Z(0) - that is, we 
have strictly reduced the error in approximation by dou- 
bling the length of the approximating sequences, but not 
going on to the next level of recursion of the standard 
algorithm, which would increase it by a factor of 5. 
This technique as it stands we call search space expansion 
(SSE). At the initial stage of the SK algorithm the search 
space is expanded once, which significantly reduces the 
residual and hence the number of subsequent levels of 
recursion needed. Algorithmically, it can be written in 
the pseudocode form given in Algorithm 1. 
We can, however, also apply SSE itself recursively at this 
initial stage, to get an even better S(0) approximation. 
Rather than performing a standard search over sequences 
of length l\ to find our approximations for (0) and 
S( 2 )(0) (equation 0, we use SSE itself to find better ap- 
proximations. So we use 5^(0) and 5^(0) as query 
gates for 2 SSE procedures, remaining the search spaces 
as in the standard SSE. 

The final approximation for S(0) is therefore a sequence 
Z'{0) of length Ah, where 

Z'(0) = Z«(0)Z«(0)z( 2 )(0)zf (0) (17) 

We call this technique recursive search space expansion 
(recursive SSE), given in pseudocode form in Algorithm 
2. We have now expanded the lookup space much further 
than was possible with the original Solovay-Kitaev algo- 
rithm, allowing for a much denser search of the space of 
possible approximating sequences. The structure of the 
algorithm remains unchanged, and as a consequence, so 
too does the scaling of the accuracy of the approxima- 
tion with the length of the sequences. The length I of a 



5 



Algorithm 2: Recursive_Space .Expansion 



5 
6 
7 
8 
9 
10 
11 



Input 
Input 
Input 
Input 
Input 
Input 

Input: 

Input: 



G G SU(2): Target of approximation 
S: Universal set of instructions 

So: Stored set of instruction sequences of gates from S of length lo 
Si: Stored set of instruction sequences of gates from S of length li 
eo : Accuracy of initial approximation 

ti : Parameter for Space_Expansion function, indicates the accuracy of initial approximation for the called 
Space-Expansion function 

ei: Parameter for Space_Expansion function, indicates accuracy of loop-internal approximation for the called 
Space_Expansion function 

k: Cardinality of the set of sequences that will be requested from Space-Expansion k > 1 



Output: Approximating sequence for G of length 41 1 

Recursive_Space_Expansion (G, S, So, Si, eo, to, k) 
R {r 6 So\ distance(r, G) < eo}; 
foreach r G R do 

Split r into 2 sub-sequences of the same length — , called r plc and r su f ; 

Ri 4— {ri G Space_Expansion(r pre , S, So, Si, ei, ei, k)}; 
Ri 4— {r-2 G Space_Expansion(r su f, S, So, Si, ei, ei, fc)}; 
Join Ri and R2 to have the following set: 
-R3 {riT2\ ri G -Ri and r-z G R2}; 
end 

r 3 -s— The best approximation for G in R^\ 
return r$ 



sequence for accuracy e is still given by 
l = 0(log c (l/e)) 



(18) 



with c f=a 3.97. However, this technique should signifi- 
cantly reduce the pref actor in scaling. 



INCREASING LOOKUP EFFICIENCY USING 
GEOMETRIC SEARCH 

The technique we have just described is very powerful 
in extending the set of searched sequences without re- 
quiring an exhaustive search over all possible approxi- 
mations. However, there is an additional search cost for 
these methods that is not present in the Solovay-Kitaev 
algorithm as commonly used. 

Firstly, each time SSE is invoked (either on its own or 
as part of a recursive SSE step), the space of sequences 
of length Iq needs to be searched to find the regions 
of sequences that are distance eo away from the subse- 
quences 5' 1,2 '(0). Each time SSE is used, an additional 
two searches are required to find the desired regions. Sec- 
ondly, whenever sequences are combined to form a longer 
approximation to G, the list of combined sequences needs 
to be searched to find the one that is closest to G. This 
happens once per use of SSE or recursive SSE. 
The first of these is by far the largest cost in our tech- 
nique, as it is can occur many times in a given use of the 
decomposition algorithm. The second is only incurred 
once per level of recursion in the standard algorithm. We 
can keep the second cost tolerable by not increasing the 



number of times SSE is used recursively on itself - for this 
reason, we describe recursive SSE as only splitting the se- 
quence twice; any more, and the search cost to find the 
best Z(0) (equation 17) would be prohibitive. Without 



this restriction, we could use recursive SSE many times 
on itself to find sequences of arbitrary length, and have 
no need for the structure of the original Solovay-Kitaev 
algorithm. However, given the exponentially increasing 
cost of this search, we chose instead to restrict recursive 
SSE to two applications of SSE, and to then proceed to 
the next level of Solovay-Kitaev recursion if further ac- 
curacy in the decomposition is required. 
In order for our techniques to be useful in feasible com- 
putational time, we need to find a way of performing 
the region-finding search in SSE efficiently. The most 
straightforward way is to search over the entire space 
of sequences up to length Iq and pick out those within 
distance eo. Such a linear search is, however, very ineffi- 
cient: in general, the search time will be exponential in 
Iq. However, we are searching a very structured space 
and should be able to make use of this structure in order 
to increase the efficiency of this search step. We will show 
now how to convert our matrix search problem into a 3D 
geometric search problem, and how we can then use the 
existing technique of geometric nearest-neighbour access 
trees (GNATs) to solve the search problem much more 
efficiently. We can use such a geometric technique in the 
original Dawson and Nielsen algorithm for the decom- 
position as well, and in the next section we will use it 
instead of a linear search when comparing our SSE tech- 
niques with the original algorithm. More sophisticated 
search techniques at this step are beginning to be devel- 
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FIG. 2: A simple GNAT with clusters. The points with 
surrounding circles are the splitting points of the same 
level clustering. 



oped, including the database searches of [14] and [15]. A 
further alternative use of geometric search techniques for 
the Dawson-Nielsen algorithm has been examined simul- 
taneously, and came to our attention after our work was 
completed [TB"] . 

We can convert any matrix SU(2) into a unique vector in 
a ball in a 3D real vector space of radius 2n and centred 
on the origin [171 Ch.5]. We can uniquely write any U € 
SU{2) as 



U = c~ 



u(v) 



(19) 



where v £ K and a is formed of the Pauli matrices, 




(20) 



We therefore can fully and uniquely specify the matrix 
U by specifying the vector v. The distance function 
that we are using to search is now no longer the trace 
norm between two matrices U\,U2 1 but rather the 
norm between the two specifying vectors given by the 
Euclidean distance function. 

We therefore have a 3D geometric search problem of find- 
ing vectors within the ball of radius 27re centred on the 
vector corresponding to the matrix S'^ 1,2 ^(0). Such geo- 
metric searches in real space have been extensively stud- 
ied, and we can therefore pick from the existing tech- 
niques the one that best suits our purposes. 



GNAT search 

The standard method for increasing the efficiency of a 
real-space search is to use a tree-based approach. By di- 
viding the search space into clusters, tree-based searches 
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FIG. 3: Time required: GNAT vs. linear search, for 
stored sequence databases with Iq = 16, 17, 18. 



are capable of reducing the computational complexity 
of a search over n entries from 0(n) in the case of the 
straightforward linear search to 0(log(n)) at least in the 
best or average case. The increasing efficiency comes 
from the ability of the search algorithms to skip entire 
clusters that are evidently not going to contain a cor- 
rect answer. Amongst the most popular approaches are 
those of R-trees and k-d trees jTSJ US] ■ These approaches 
divide the search space into clusters specified by coor- 
dinates in the space. These approaches were tried, and 
did not yield improved search results. Instead, we found 
that geometric nearest neighbour access trees (GNATs) 
are very effective. 

In a GNAT search, the space is partitioned into clusters 
around a number of fixed points called splitting points [9] . 
How best to choose these splitting points is an open re- 
search question; we use the common approach of picking 
them at random. At the initial stage of processing, each 
vector in the space is then analysed in turn to find its 
closest splitting point, and the distance from this point. 
The set of vectors that are closest to a given splitting 
point is the cluster associated with that point. The clus- 
ter size can be defined by the greatest and least distance 
from all vectors in the cluster to the splitting point. Each 
cluster, if desired, can in turn be partitioned using split- 
ting points to produce sub-clusters. This procedure can 
be applied recursively as many times as desired, until the 
size of each cluster is small enough that a linear search 
within the cluster is feasible. Figure [2] shows clusters 
around splitting points in a 2D space. 
Calculating and storing the data associated with each 
point (which cluster it belongs to, and how far away it is 
from the splitting point) and each cluster (maximum and 
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minimum distance between splitting point and all data 
points in the cluster) is the most computationally expen- 
sive step in the search procedure. However, when we use 
this procedure in our modified Solovay-Kitaev algorithm, 
we only need perform this step a single time for each set 
of library gates. This can be performed offline before the 
algorithm begins, and the structured data can be re-used 
for any decomposition problem using that library set. 

The search proceeds as follows. Suppose we are searching 
for all the nearest neighbours {p} that are within distance 
e of a given query point x, ie D(jp, x) < e where D is our 
distance function. We start by looking at the top-level 
clusters, with splitting points Sj. The distance between 
the splitting point and the query point is D(si,x) = di- 
By then applying the triangle inequality, we can see that 
the points {p} can only belong to clusters where the dis- 
tance between the point p and its splitting point satisfies 

d,-e< D{p, Si ) <d, + e (21) 



All other clusters can be rejected, and the search at the 
next level concentrated on those that remain. 

We can see how GNAT significantly improves the search 
time over a linear search. Figure|3]shows the average time 
in ms to find the e-region around a query point for data 
sets with different volumes. These data are the average 
over 120 runs for each sized search space, choosing differ- 
ent query points each time. Note in particular that the 
time axis is on a log scale - GNAT clearly outperforms 
linear search by two orders of magnitude. 



COMPARISON WITH THE ORIGINAL 
ALGORITHM 

In order to test the modifications to the Solovay-Kitaev 
algorithm, we implemented the original, original+SSE, 
and original+recursive SSE algorithms to find the length 
of gate sequences generated for a given gate and level 
of accuracy. A set of 25 different randomly-generated 
matrices in SU (2) was generated, and then approxima- 
tions found for each matrix using the three different al- 
gorithms. The library gate set used was the minimal set 
{H, T, T^}, and the stored sequences were of length up 
to lo = 18. For each gate the best approximation at each 
of the first four levels of Solovay-Kitaev recursion was 
recorded. In all cases, GNAT search was used to find 
e-regions in the space of sequences of length up to Iq. 
The largest quantum cost in fault-tolerantly implement- 
ing a sequence approximating the desired gate is, in gen- 
eral, in implementing the T gates (as these require magic 
state injection). We therefore take as the appropriate 
measure of length for an approximating sequence the 
number of T gates in the sequence. Figure [4] shows the 
length (number of T gates) vs. the accuracy of the ap- 
proximation for the best approximating sequence found 
by each algorithm, for each of the 25 randomly generated 
unitary matrices. The step-like behaviour in all three 
cases is caused by the recursion levels of the original al- 
gorithm. Both SSE and recursive SSE are significant 
improvements over the original algorithm, with recursive 
SSE clearly the better of the two. In both cases the 
length of sequences for a given approximation accuracy 
is reduced - for example, when the accuracy required is 
around 10 -4 then SSE alone reduces the length of the 



8 



best sequence by a factor of three, and recursive SSE re- 
duces it by a factor of seven. The number of levels of 
Solovay-Kitaev recursion also reduces significantly, from 
n = 4 using the original algorithm to n = 3 with SSE 
and to n = 2 with recursive SSE. 

Recursive SSE is therefore clearly better than the stan- 
dard Solovay-Kitaev algorithm at producing gate se- 
quences that cost less to implement in terms of quan- 
tum resources on a quantum computer. However, this is 
not the only consideration: we must also take into ac- 
count the classical processing time needed to find the 
best approximating sequence in each of the cases. Fig- 
ure [5] shows this classical pre-processing time vs. accu- 
racy of the resulting approximating sequence. As the 
two plots are of the same dataset, they may be directly 
compared through their x-axes. As we would expect, the 
more computationally intensive recursive SSE algorithm 
takes a much longer time to run. However, note the log- 
scale on the time axis; the curve for recursive SSE is in 
fact sub-exponential in its scaling. 

We can therefore conclude that there is a straightfor- 
ward trade-off between reducing the quantum cost of im- 
plementing a given single-qubit unitary, and the classi- 
cal pre-processing time to find the sequence. In general, 
given the relative state of the two technologies, we would 
prioritize decreasing the quantum cost at the expense 
of classical processing. It is also important to bear in 
mind that sequence-finding can be run offline, before the 
algorithm starts, whereas implementing the approximat- 
ing sequence in terms of quantum gates is by definition 
online. We therefore have an algorithm that gives us a 
shorter approximating sequence, using classical process- 
ing that scales less strongly than [5]. This is therefore 
a middle-ground between such a flat, linear search, and 
the structured, sparse search of the standard recursive 
Solovay-Kitaev algorithm. 

CONCLUSION 

We have given a modified version of the Solovay-Kitaev 
algorithm that greatly reduces the length of the se- 
quences used to approximate a unitary single-qubit gate. 
Our technique also reduces the number of levels of re- 
cursion required by the algorithm to reach a given level 
of accuracy. By reducing the depth of the quantum 
circuit used to approximate a given gate, we are then 
able to reduce the amount of error correction needed for 
fault-tolerant implementations of quantum algorithms. 
The cost for this is an increase in the classical process- 
ing needed to find these shorter and less costly quan- 
tum sequences, but the use of structured GNAT searches 
enables this to be performed in time that scales sub- 
exponentially (and can also be performed offline before 
the quantum algorithm starts). By increasing the space 
that is searched at the initial level of recursion in the orig- 



inal algorithm, we are able to use the powerful method 
of the recursive steps, but without leaving so much of the 
search space unexplored between levels of recursion. 
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